NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia

Samir, Farhan; Park, Chan_Young; Field, Anjalie; Shwartz, Vered; Tsvetkov, Yulia (December 2024, EMNLP)

Full Text Available
Examining risks of racial biases in NLP tools for child protective services

https://doi.org/10.1145/3593013.3594094

Field, Anjalie; Coston, Amanda; Gandhi, Nupoor; Chouldechova, Alexandra; Putnam-Hornstein, Emily; Steier, David; Tsvetkov, Yulia (June 2023, FAccT '23: Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency)

Although much literature has established the presence of demographic bias in natural language processing (NLP) models, most work relies on curated bias metrics that may not be reflective of real-world applications. At the same time, practitioners are increasingly using algorithmic tools in high-stakes settings, with particular recent interest in NLP. In this work, we focus on one such setting: child protective services (CPS). CPS workers often write copious free-form text notes about families they are working with, and CPS agencies are actively seeking to deploy NLP models to leverage these data. Given well-established racial bias in this setting, we investigate possible ways deployed NLP is liable to increase racial disparities. We specifically examine word statistics within notes and algorithmic fairness in risk prediction, coreference resolution, and named entity recognition (NER). We document consistent algorithmic unfairness in NER models, possible algorithmic unfairness in coreference resolution models, and little evidence of exacerbated racial bias in risk prediction. While there is existing pronounced criticism of risk prediction, our results expose previously undocumented risks of racial bias in realistic information extraction systems, highlighting potential concerns in deploying them, even though they may appear more benign. Our work serves as a rare realistic examination of NLP algorithmic fairness in a potential deployed setting and a timely investigation of a specific risk associated with deploying NLP in CPS settings.
more » « less
Full Text Available
Challenges and Opportunities in Information Manipulation Detection: An Examination of Wartime Russian Media

Park, Chan Young; Mendelsohn, Julia; Field, Anjalie; Tsvetkov, Yulia (December 2022, Empirical Methods in Natural Language Processing)

NLP research on public opinion manipulation campaigns has primarily focused on detecting overt strategies such as fake news and disinformation. However, information manipulation in the ongoing Russia-Ukraine war exemplifies how governments and media also employ more nuanced strategies. We release a new dataset, VoynaSlov, containing 38M+ posts from Russian media outlets on Twitter and VKontakte, as well as public activity and responses, immediately preceding and during the 2022 Russia-Ukraine war. We apply standard and recently-developed NLP models on VoynaSlov to examine agenda setting, framing, and priming, several strategies underlying information manipulation, and reveal variation across media outlet control, social media platform, and time. Our examination of these media effects and extensive discussion of current approaches’ limitations encourage further development of NLP models for understanding information manipulation in emerging crises, as well as other real-world and interdisciplinary tasks.
more » « less
Full Text Available
Gendered Mental Health Stigma in Masked Language Models

Lin, Inna; Njoo, Lucille; Field, Anjalie; Sharma, Ashish; Reinecke, Katharina; Althoff, Tim; Tsvetkov, Yulia (December 2022, Empirical Methods in Natural Language)

Mental health stigma prevents many individuals from receiving the appropriate care, and social psychology studies have shown that mental health tends to be overlooked in men. In this work, we investigate gendered mental health stigma in masked language models. In doing so, we operationalize mental health stigma by developing a framework grounded in psychology research: we use clinical psychology literature to curate prompts, then evaluate the models’ propensity to generate gendered words. We find that masked language models capture societal stigma about gender in mental health: models are consistently more likely to predict female subjects than male in sentences about having a mental health condition (32% vs. 19%), and this disparity is exacerbated for sentences that indicate treatment-seeking behavior. Furthermore, we find that different models capture dimensions of stigma differently for men and women, associating stereotypes like anger, blame, and pity more with women with mental health conditions than with men. In showing the complex nuances of models’ gendered mental health stigma, we demonstrate that context and overlapping dimensions of identity are important considerations when assessing computational models’ social biases.
more » « less
Full Text Available
An analysis of emotions and the prominence of positivity in #BlackLivesMatter tweets

https://doi.org/10.1073/pnas.2205767119

Field, Anjalie; Park, Chan Young; Theophilo, Antonio; Watson-Daniels, Jamelle; Tsvetkov, Yulia (August 2022, Proceedings of the National Academy of Sciences)

Emotions are a central driving force of activism; they motivate participation in movements and encourage sustained involvement. We use natural language processing techniques to analyze emotions expressed or solicited in tweets about 2020 Black Lives Matter protests. Traditional off-the-shelf emotion analysis tools often fail to generalize to new datasets and are unable to adapt to how social movements can raise new ideas and perspectives in short time spans. Instead, we use a few-shot domain adaptation approach for measuring emotions perceived in this specific domain: tweets about protests in May 2020 following the death of George Floyd. While our analysis identifies high levels of expressed anger and disgust across overall posts, it additionally reveals the prominence of positive emotions (encompassing, e.g., pride, hope, and optimism), which are more prevalent in tweets with explicit pro-BlackLivesMatter hashtags and correlated with on the ground protests. The prevalence of positivity contradicts stereotypical portrayals of protesters as primarily perpetuating anger and outrage. Our work offers data, analyses, and methods to support investigations of online activism and the role of emotions in social movements.
more » « less
Full Text Available
Gendered Mental Health Stigma in Masked Language Models

https://doi.org/10.18653/v1/2022.emnlp-main.139

Lin, Inna; Njoo, Lucille; Field, Anjalie; Sharma, Ashish; Reinecke, Katharina; Althoff, Tim; Tsvetkov, Yulia (January 2022, EMNLP: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
A Survey of Race, Racism, and Anti-Racism in NLP

https://doi.org/10.18653/v1/2021.acl-long.149

Field, Anjalie; Blodgett, Su Lin; Waseem, Zeerak; Tsvetkov, Yulia (August 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers))

Full Text Available
Multilingual Contextual Affective Analysis of LGBT People Portrayals in Wikipedia

Park, Chan Young; Yan, Xinru; Field, Anjalie; Tsvetkov, Yulia (January 2021, Proceedings of the International AAAI Conference on Web and Social Media)

Full Text Available
Entity-Centric Contextual Affective Analysis

Field, Anjalie; Tsvetkov, Yulia (August 2019, 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019))

While contextualized word representations have improved state-of-the-art benchmarks in many NLP tasks, their potential usefulness for social-oriented tasks remains largely unexplored. We show how contextualized word embeddings can be used to capture affect dimensions in portrayals of people. We evaluate our methodology quantitatively, on held-out affect lexicons, and qualitatively, through case examples. We find that contextualized word representations do encode meaningful affect information, but they are heavily biased towards their training data, which limits their usefulness to in-domain analyses. We ultimately use our method to examine differences in portrayals of men and women.
more » « less
Full Text Available
Contextual Affective Analysis: A Case Study of People Portrayals in Online #MeToo Stories

Field, Anjalie; Bhat, Gayatri; Tsvetkov, Yulia (June 2019, THE 13TH INTERNATIONAL AAAI CONFERENCE ON WEB AND SOCIAL MEDIA (ICWSM-2019))

In October 2017, numerous women accused producer Harvey Weinstein of sexual harassment. Their stories encouraged other women to voice allegations of sexual harassment against many high profile men, including politicians, actors, and producers. These events are broadly referred to as the #MeToo movement, named for the use of the hashtag “#metoo” on social media platforms like Twitter and Facebook. The movement has widely been referred to as “empowering” because it has amplified the voices of previously unheard women over those of traditionally powerful men. In this work, we investigate dynamics of sentiment, power and agency in online media coverage of these events. Using a corpus of online media articles about the #MeToo movement, we present a contextual affective analysis—an entity-centric approach that uses contextualized lexicons to examine how people are portrayed in media articles. We show that while these articles are sympathetic towards women who have experienced sexual harassment, they consistently present men as most powerful, even after sexual assault allegations. While we focus on media coverage of the #MeToo movement, our method for contextual affective analysis readily generalizes to other domains.
more » « less
Full Text Available

« Prev Next »

Search for: All records